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We study the non-equilibrium dynamics of observables for finite one-dimensional quantum sys- 
tems. By calculating the dependence of time and statistical averages on system size we investigate 
the role of the local and the exponentially many non-local conserved quantities for thermalization. 
We derive, in particular, a Mazur-type equality to split observables into a local and a non-local 
part with time and statistical averages for the local part being identical by construction. We then 
demonstrate that fluctuations in the non-local part are, in general, responsible for thermalization 
irrespective of whether or not the much stricter condition posed by the eigenstate thermalization 
hypothesis holds. 
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Introduction Starting a classical many-body system 
from a typical initial configuration we expect that the sys- 
tem equilibrates at long times so that typical observables 
become time independent. If the ensuing dynamics is er- 
godic, then the ergodic theorem [IH2] tells us that we can 
replace the time average of observables by an ensemble 
average. The ensemble provides a probability measure p 
on phase space which has to be invariant under time evo- 
lution because it describes the stationary state reached 
after the system has thermalized. The probability mea- 
sure thus has to be a function of the conserved quantities 
Qj with {H, Qj} = where {., .} is the Poisson bracket 
and H the Hamilton function of the system. In cases 
where rl is the only independent conserved quantity this 
invariant phase space measure is the familiar microcanon- 
ical ensemble which becomes equivalent to the canonical 
one in the thermodynamic limit (TDL). Integrable sys- 
tems with phase space dimension 2N, on the other hand, 
have N independent conservation laws making them non- 
ergodic and restricting the motion in phase space to in- 
variant tori. Importantly, the Kolmogorov-Arnold-Moser 
(KAM) theorem allows to describe the consequences of 
small integrability breaking perturbations on a quantita- 
tive level HIS]. 

Although recent theoretical [SHIT] and experimental 
[T^ - frS] studies have lead to new interesting insights, it 
is still fair to say that the non-equilibrium dynamics of 
a quantum system is much less well understood. One 
example, which has been studied numerically [5], is the 
expansion of a bosonic lattice gas into a larger empty 
lattice. In addition, also a number of other quantum 
quenches in lattice models with short-range interactions 
have been studied [9, 16-24 . In all cases the initial en- 
ergy distribution becomes singly-peaked in the TDL with 
vanishing width [5]. Thermalization in these models has 
often been explained by the eigenstate thermalization hy- 
pothesis (ETH) [6-8, 25_ which assumes that the expec- 
tation values of the considered operator in the eigenstate 
basis fluctuate little between eigenstates close in energy 



and can thus be directly replaced by an ensemble average. 

In this letter we show that thermalization, in principle, 
can also occur in a more generic scenario where fluctua- 
tions between eigenstate expectation values close in en- 
ergy are large. We derive a criterion for these fluctuations 
which has to be fulfilled if the system thermalizes. 

We restrict our discussion to time-independent Hamil- 
tonians H with short-range interactions in position space 
and a discrete spectrum. Position space, however, is by 
no means special: For a Hamiltonian with e.g. local in- 
teractions in momentum space the same approach can be 
used and the results directly carry over. The time aver- 
age of an observable O for an initial pure state |^o) is 
defined by 

0= lim 1 / dt(y \e lHt Oe- lHt \y Q ). (1) 

r^oo T J Q 

By using a spectral representation of the observable O 
and assuming that the spectrum of H is non-degenerate 
|45j we immediately obtain 

0=(0) diag = (*o|Odiag|* ) = J2°nn\(^0\n)\ 2 (2) 

= \c{nW 

where O d ia g = Y^ n O nn P n . Here O nn = (n\0\n) with 
Pn = \n) (n\ being the projection operator onto the eigen- 
state \n) and H\n) — e n \n). Off-diagonal elements of O 
do not contribute in time or statistical averages, i.e., to 
address thermalization it is sufficient to consider the di- 
agonal part. 

If we want to replace the time average by an ensemble 
average following the prescription in the classical case, 
we immediately encounter an important aspect which is 
different from the classical problem: All the projection 
operators are conserved, [H, P n ] = 0, and the number 
of conserved quantities thus always equals the Hilbert 
space dimension D. A density matrix pdiag which yields 
a statistical average equal to the time average Q thus 
apparently has to be a function of all the projection op- 
erators [25], e.g., 
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Pdiag = exp(- ^ ^nP^/Zdmg (3) 
n 

with Zdiag = Tr pdiag and Lagrange multipliers A„ ful- 
filling the condition e~ Xn /Z diag = |(^o|^)| 2 - While it is 
easy to see that this ensemble indeed fulfills Tr(Opdiag) = 
(O)diag = O, see Eq.([2j), by construction and thus naively 
proves the ergodic theorem in the quantum case, it de- 
pends on the initial state and fixes the microstate of the 
system up to phase factors which are irrelevant for the 
long-time dynamics. In quantum statistical mechanics we 
are, however, only interested in the macrostate consist- 
ing of many microstates which cannot be distinguished 
by local probes (2"5] , 

The new important ingredient, which has to be taken 
into account in the quantum case, is the distinction be- 
tween local and non-local conserved quantities. A local 
operator for a lattice model is defined as Q n = ^ . qj 
where g" acts on lattice sites j, j + 1, • • ■ ,j + n only, with 
n finite. In a field theory this then becomes a fully local 
operator Q n — J drq n (r) . In the TDL we expect that 
only the local conserved quantities are needed in the den- 
sity matrix to describe the thermalized state. This num- 
ber is usually finite for a generic quantum system while it 
increases linearly — but not exponentially! — with system 
size for an integrable one-dimensional model (271 12"5] . Ex- 
periments on cold atomic g well as most numerical 
studies of the thermalization problem are, however, done 
on finite systems [HI Q30 where a distinction between 
local and non-local conservation laws, strictly speaking, 
does not exist. Understanding the scaling with system 
size of the local and non-local contributions to the en- 
semble is therefore a problem of practical relevance. 

General considerations: A Mazur-type equality To 
obtain a condition for thermalization on the microscopic 
level of individual eigenstates we write 

= lim AO N = lim [(0) diag - (0) th ] (4) 

N— s-oo iV— »oo 

" ^ ' |c(")| 2 P5T 

^4°° J deO(e)[\c(s)\ 2 u(e)-p th (e)u( £ )} 
r di a g (e) r th ( £ ) 

where (0)th= Tr{Op t h} with p th = exp(- S n A„Q„)/Z, 
Z = Tr pth being the appropriate thermal density matrix 
including all the local conserved quantities Q n with La- 
grange parameters {A„} determined by the set of equa- 
tions (*o|Q„|# ) = Tr{Q„pth} [21,. Furthermore, we 
have rewritten the matrix elements O nn in the generic 
form of a projection. In the last line we have introduced 
a coarse-grained description in the thermodynamic limit 
with 0(e) = Y,e-de<e n <e-de°nn and similarly a ther- 
mal energy distribution r t h(e) and initial energy distri- 
bution rdiag(e) both including the coarse-grained density 



of states v(e). The assumption of the ETH scenario is 
that O nn becomes a smooth function of energy e n in the 
TDL, i.e., 0{e = e n ) = O nn so that a coarse graining is 
not required. One of the main points of our study is that 
this is unnecessarily restrictive. Note that both the initial 
energy distribution rdi ag (e) as well as the thermal distri- 
bution r t h(e) will become sharply peaked in the TDL so 
that only a small energy window will contribute to the 
integral in the last line of Eq. Q . In order to investigate 
thermalization in more detail we explain in the following 
how to separate an observable into a part proportional 
to the local conserved quantities and an orthogonal part. 

The distinction between local and non-local conserved 
quantities has turned out to be essential for quantum 
transport. Here it is known that a local or quasi-local 
|30| conserved quantity can protect a current from decay- 
ing while a non-local conserved quantity cannot |31H34| . 
This is understood by using a Mazur equality pUJ HSU [35] 
where the current operator is written as a sum of pro- 
jections onto all the conserved quantities of the system. 
In exactly the same way, we can project the considered 
observable onto the local and non-local conserved quan- 
tities to study thermalization. Note that this is nothing 
but a simple basis change in operator space. Instead of 
using P n = |n)(n| as the basis we use the / many local 
conserved quantities which can be written as 

D 

P n = Q n = J2 a ? p j (n = 0,--- (5) 

with coefficients a" plus D — f many non-local operators 
P n , n = /,••• ,D - 1 such that {P n P m )th = (-P,?)tiAm 
are orthogonal. Here we have choosen (• • • )th = 
Tr{---pth} as the natural inner product in the vector 
space of Hermitian operators. 

To be concrete we consider the case that the system 
has only a single local conservation law — the Hamiltonian 
itself which can be written as H = Y^j e jPj- The thermal 
density matrix p t h = exp(—H/T)/Z is then just the usual 
canonical ensemble expected to describe the system at 
long times in the TDL. As a first step we choose the 
set of normalized operators {H, i\, P2, ■ ■ ■ , Pd-i} where 
H = H/^fW), P n = Pn/^/Wt with Pi = P n . We 
now define the normalized operator 

U = (H-P D )/yJ((H-P D )*) (6) 

so that 

R(H) = H- 2U(UH) = P D (7) 

where Pjj is the projection operator which we have re- 
placed by H . Applying the same orthogonal transforma- 
tion (a so-called Householder rotation) onto the projec- 
tion operators 

R(Pi) =Pi- 2U{UP t ) = Pi (8) 
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for i — 1, 2, • ■ ■ , D — 1 thus generates the required or- 
thonormal set {P = H,P U P 2 ,--- ,-Pd-i}- With the 
help of this new basis in operator space we can split Odiag 
into a local and a non-local part 



0.025 



^diag — 



(OH) 



D-l 



H 



(9) 



Energy conservation during time evolution demands 
(^ol-ffl^o) = (H)th which fixes the temperature T 
[HI HOI IE] and guarantees the equivalence of the diag- 
onal and canonical ensemble average for the first term, 
Oioc in ([£]), which is proportional to H. Eq. Q can thus 
be rewritten as a condition for the non-local part only. 

We are typically interested in observables O, e.g. cor- 
relation functions, which are not affected by an energy 
shift H' = H — E . In this case we can simplify the con- 
dition on the non-local part further by demanding that 
(Ononioc)th = 0. For this gauge we have to choose the 
reference energy 



En 



(OH) th (H) th - (0) th (H 2 ) th 



(OH) th - (0} th (H) 



th 



(10) 



The condition for thcrmalization (El then reads 



D-l 

= Jim (O„onioc)diag = Jim V" AOjv(n) 



O 



AOjs 



(11) 



where the last line defines the diagonal matrix elements 
O nn in the new operator space basis. Using this new ba- 
sis we have achieved a separation into a part Oi oc with 
(Oi oc )diag = (Oi oc )th by construction and a part, O non i oc , 
for which the agreement between time and statistical av- 
erage is non-trivial. 

The model So far, our discussion has been general. 
To test the concepts we have introduced above, we want 
to study in the following a specific lattice model, the one- 
dimensional anisotropic Heisenberg model 



jv 



JV 



H(A,J 2 ) = ■ J^hj,,. (12) 

i=i j=i 

1 



h.c.) + ASfS], 



where S is a spin- 1/2 operator, J (J2) are the superex- 
change couplings for the nearest (next-nearest) neigh- 
bors, respectively, and A parametrizes an exchange 
anisotropy. In the following we set J = 1, use periodic 
boundary conditions, and study the model by exact di- 
agonalization taking spin and spatial symmetries into ac- 
count |37]. The model (12 1 is integrable for J 2 — 0. In 
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FIG. 1: (Color online) (a) Quench with |* (5, 0.2)) and 
#(1,0.2). Expectation values (SiSi+i) using different ensem- 
bles: (1) Difference between canonical and diagonal ensemble 
and linear fit, (2) between canonical and extended ensemble 
pp 1 and exponential fit, (3) between pp and the diagonal 
ensemble and the difference of the two fits, (b) Difference 
of canonical and diagonal ensemble average for an integrable 
quench with |* (5,0)} and #(1/2,0). 



this case the number of conserved local operators Q n in- 
creases linearly with system size N. In the non-integrable 
case J 2 7^ 0, H itself and S^ ot = ■ 5| are the only 
conserved local operators. To make contact with pre- 
vious thermalization studies, we consider a quench sce- 
nario [381 155] . As initial state we choose the ground 
state |^q(A, J 2)) of the Hamiltonian |l2"| ) with parame- 
ters A and J 2 . We then time evolve with H(A, J2) where 
(A,J 2 )^(A,J 2 ). 

Locality and statistical ensemble We start by study- 
ing the role played by non-local conserved quantities for 
the statistical ensemble. To do so we define an extended 
canonical ensemble 



PP] = cxp(-f3H + XjPj)/Z, 



(13) 



where the two parameters j3, X 
(*o|ff|*o) = Tr{ifpp.} and 



*o 1^-1*0) 



are determined by 

= 1V{ /',,/-;. |, 

This extended ensemble is compared with the canonical 
and the diagonal ensemble in Fig. [l] for the generic case 
j = 1 with Pi as defined in Eq. We have found that 
the qualitative results for the finite size scaling are in- 
dependent of which of the Pj we choose in the extended 



ensemble (13 1 and consistent with the following: (a) The 



change in the average of an observable caused by includ- 
ing an additional non-local conservation law vanishes ex- 
ponentially with system size, (b) The contribution of all 
non-local conservation quantities to the ensemble average 
vanishes linearly in 1/N. This follows from the fact that 
the canonical ensemble is obtained from the diagonal one 
by neglecting all non-local quantities, see Eq. |3|. For a 
finite system the non-local conserved quantities do, how- 
ever, contribute showing that the TDL is essential for a 
full thermalization. In the integrable case shown in the 
inset of Fig. [T] the canonical ensemble does not describe 
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FIG. 2: (Color online) Longer-range correlations for the same 
quench as in Fig. [TJa). Shown is the scaling of the relative 
deviation between ensemble averages A re i for (a) \i — j\ = 
1, 2, 3, 4, 5 (from bottom to top) and (b) for \i-j\ = N/2. 



the equilibrated state at long times because other local 
conserved quantities are not taken into account |40| . 

Locality and observables Next, we want to study how 
the amount of locality of the operator itself affects its 
thermalization. Numerical data for O = (SiSj), which 
is a two-point correlation function of a fully local op- 
erator, are shown in Fig. [2] The relative deviation 



A ld = \({SiS 



j/diag 



-(SiS 



i *-* j I can 



)/(SiS. 



j/diag 



between the 



long-time mean (diagonal ensemble) and the canonical 
ensemble average becomes larger the larger the distance 
is and thus the less local O is, see Fig. [2ja) . If we fix 
the distance between the spin operators to N/2 as in 
Fig. [2jb) then the canonical ensemble average approaches 
zero with increasing N much faster than the long-time 
mean so that A re i — > 1. Thus not only the initial distri- 
bution (4> |-Pn|*o> ~ (*o|-Pn|*o), see Suppl. Mat., but 



also the operator-dependent first term in Eq. (Ill, O nn , 
has to play a crucial role in the thermalization process. 

Projection onto locally conserved quantities To study 
the role of locality of the operator in more detail we now 
use the splitting into a local and a non-local part denned 
in Eq. The diagonal distribution function |c(e„)| 2 , 
and the microcanonical and canonical ones, r miC/ / can (e), 
for the same quench as before are shown in Fig. |3ja). 
While the diagonal distribution is expected to become 
singly peaked in the thermodynamic limit [S], this is 
clearly not the case for the system sizes within reach of 
exact diagonalization. The initial state distribution thus 
cannot be simply replaced by the microcanonical ensem- 
ble. Furthermore, O nn , shown in Fig. [3^bl) and (b2) 
for two different correlation functions, shows large fluc- 
tuations. Thus ETH is not directly applicable, yet we 
already see clear indications for thermalization as shown 
in Fig. [1] and Fig. [2] 

To understand these findings we return to the more 



generic condition for thermalization, Eq. (Ill, which only 
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requires that the sum of the fluctuations in the non- 
local part averages out to zero. Indeed, O nn , shown in 



FIG. 3: (Color online) Same quench as in Fig. Qa) for N = 
16. Shown are: (a) the initial, microcanonical and canonical 
energy distribution functions, and results for O — SiSi+i in 
(b) and O = SiSi+4 in (c). (bl) and (cl) show O n „ while 
(b2) and (c2) show O 



Figs. [3^b2) and (c2), has much smaller fluctuations cen- 
tered around zero and both correlation functions display 
similar behavior once the local part is separated off. Al- 
though we see clear evidence for thermalization, no con- 
clusive finite size scaling for the fluctuations in O nn could 
be obtained so that the question whether or not the ETH 
holds in the considered case has to remain open. For 
O nn , however, we do obtain a clear finite size scaling (see 
Suppl. Mat.) showing that O nn — > for all n in the 
TDL. Thus every single term in the sum in Eq. (JTlJ be- 
comes zero so that the projections of the observable onto 
the nonlocal conserved operators can be ignored allowing 
the system to thermalize to the canonical ensemble. 

Conclusions To summarize, we have looked at the 
question of thermalization in closed quantum systems 
from the perspective of conservation laws. Contrary to a 
classical system two distinct kinds appear: local and non- 
local ones. By choosing a new basis in operator space we 
have separated the considered observable into a projec- 
tion onto the locally conserved quantities, for which the 
time and statistical average agree by construction, and a 
projection onto the non-local ones where thermalization 
takes place. The condition for thermalization which is 
obtained by this separation involves fluctuations in the 
non-local part only and is more general than ETH. This, 
in principle, opens up the possibility that systems exist 
which do thermalize although the ETH does not apply. 

The authors thank F.H.L. Essler, B. Fine, 
A. Polkovnikov, and M. Rigol for discussions and 
acknowledge support by the Collaborative Research 
Centre SFB/TR49 and the Graduate School of Excel- 
lence MAINZ. 
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Supplementary material 

The purpose of our supplementary material is twofold. 
In Sec. [] we will first provide some additional information 
about the basis rotation required to split an observable 
into a local and a non-local part, numerical data test- 
ing the ETH which proved to be inconclusive, as well as 
some technical details about the coarse graining proce- 
dure used to obtain a continuous density of states. Sec- 
ondly, we also provide some additional information about 
what happens in the case of integrable systems in Sec. [] 



Details of the calculation 

In Sec. [] we give some details which are helpful to un- 
derstand the splitting of an observable into a local and 
a non-local part in a more general context. In Sec. [] we 
present a finite size scaling analysis of the fluctuations in 
O nn and O nn . Finally, we explain in Sec. | the coarse 
graining procedures used to obtain the continuous distri- 
butions. 



Splitting into local and non-local part 

In the paper we have argued that the proper basis in 
operator space to study thermalization consists of the lo- 
cal conserved operators plus an additional set of non-local 
operators which span the rest of the operator space. In 
this basis the time and statistical average for the part of 
the observable obtained by projecting onto the local con- 
served operators agree by construction so that thermal- 
ization becomes a statement about the projection onto 
the non-local conserved operators only. 

Choice of inner product As in Eq. (9) of the paper we 
can write the diagonal part of an arbitrary observable O 
for the case that H is the only local conserved operator 



dias " <ff 2 > h m " 



O non l oc 



8 IP- 



J In the pa- 
Tr{ - ■ • p th } as in- 



with (HP n ) = and (P n P m ) 
per we have chosen (■ • • ) = (• ■ ■ )th 
ner product in the vector space of Hermitian operators. 
While this is the natural choice, it is also possible to use 
any other proper inner product in operator space. The 
splitting into a local and a non-local part thus depends 
on the choice of the metric. This is, of course, expected 
since the very notion of distances requires the definition 
of a metric first. 

Using a general inner product and projection opera- 
tors P n which are orthogonal with respect to this metric, 



Eq. (11) of the paper can be written in the more general 
form 

= lim AO N = lim [{0} dias - {0} th \ (15) 

N—t-oo N^-oo 



D-l 



lim > 

JV-s-oo ^— ' 



(OP n ) 




• Hi 



We want to stress that AOn does not depend on the 
choice of the inner product. It is only once we start 
considering the contributions on a microscopic level that 
this choice becomes relevant. 

Energy shift Apart from the inner product there is 
another 'gauge degree of freedom' in the second line of 
Eq. (15). By shifting the energy, H' = H — En, the pro- 



jection operators P n are modified because of the orthog- 
onality condition (HP n ) = 0. The qualitative behavior 
of O nn is, however, not affected. A convenient, unique 



gauge is obtained by demanding that O nn (P n 



■ih 



0. 



For a general metric 



this is achieved by choosing 



En = 



(OH)(H)-(0)(H* 



(OH) + (0)(H) th -2(0) th (H) 



(16) 



for a given Hamiltonian H and operator O. By setting 
(• • • ) = Tr{- • • pth} this expression simplifies to Eq. (10) 
of the paper. 

Relation between the original and the rotated basis 
Once an inner product has been chosen, the new basis 
consisting of the / local conserved quantities and D — f 
non-local operators can be orthonormalizcd by using the 
Householder reflection given in the paper. To be con- 
crete, we choose again (■ ■ ■ ) = Tr{- • • p t h} as the inner 
product and assume that H is the only local conserved 
quantity. The relation between the old and new operator 
basis can then be expressed as 



D 



D 



Pm = < P « = <V( P n)thPn (m = !,-■■, D-\) 



71=1 



n=l 



(17) 

( i4 ) where P n = P n /\f(Pjhh with P% = P n are the nor- 



malized projection operators, (PiPj)th 



Using the 



definition of the Householder reflection and some simple 
algebra we find 



((h-PdK 



til- 



th 



(18) 

While ({H - P D ) 2 ) th cx 0(1) we have ^{P^ cx e~ N / 2 
so that the expansion coefficients a™ 1 are sharply peaked 
at n = m. As a consequence, the initial distribution is 
not affected by the rotation in the thermodynamic limit, 



*o|^n|*o) "4°° (^ol^nl^o) (n = 1, ■■■ ,D- 1) (19) 
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and becomes sharply peaked in the thermodynamic limit. 
The matrix elements of the observable 



10 F 



0„ 



(OPm) th 



Pn)thO 



(20) 



are. however, changed because they are given by sum- 
ming over the exponentially many old matrix elements 
O nn so that the exponentially small corrections in a™, 



see Eq. (18), still matter. 



Fluctuations in O nn and O nn and the eigenstate 
thermalization hypothesis 

According to the eigenstate thermalization hypothesis, 
O nn should become a smooth function of the eigenenergy 
e n in the thermodynamic limit. For the system sizes we 
can exactly diagonalize we are clearly far from that limit 
and fluctuations in O nn are large. Nevertheless, we can 
check how these fluctuations scale with system size. 

We define the average size of the fluctuations in an 
interval: 



\O nn - (O 

Tin/ mc I 



(21) 



The prime on the sum refers to a restriction to an energy 
interval of 0.05 times the bandwidth Wa = £d — £i 5 and 
centered on the middle of the spectrum, E = Wa/2 + 
£i, with £i the ground state energy. This can be more 
precisely defined as 



r( £ ) 



D 

= £ 

n=l 

= 6\e- 



■ T(e n — E), where 

0.05VF A ] - [e - 0.05W A ] (22) 



and 0(e) is the Heaviside function. (O nn ) mc is the lo- 
cally defined average, in other words the microcanonical 
ensemble, calculated here with the same energy window: 
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FIG. 4: (Color online) Quench with |* (5, 0.2)} and H(l, 0.2). 
Shown is a comparison of (a) Aq, (b) Ao, and (c) Ao/{0)e 
for different system sizes from N = 8 to 16. Plotted are the 
observables O = SiSj with \i—j\ = 1 (black circles), \i—j\ = 2 
(red squares), \i — j\ =3 (green triangles), \i — j\ =4 (blue 
diamonds), and \i — j\ = 5 (purple triangles). Aq shows clear 
exponential scaling to zero (note the logarithmic scales). 



We consider again the same quench as in the paper 
with |^o(5,0.2)) and H (1,0.2), and look at observables 
O = SiSj for different \i - j\. In Fig.|i]we plot Aq and 
Ao for system sizes N = 8 to 16. The absolute magni- 
tude of the fluctuations of AO , FigQb) , can be several 
orders of magnitude larger than their average value even 
for N = 16, see Fig. Qc), so that ETH can not yet be 
said to apply. In particular it does not seem possible 
that this can be the explanation for the data of Fig. 2(a) 
of the paper. Though the trend for the fluctuations ap- 
pears to be for them to become on the whole smaller for 
larger system sizes, no clear-cut scaling can be seen. Fur- 
thermore, the relative size of the fluctuations, defined as 
Ao / {0)e as in Ref. [H] , also shows no conclusive scaling 
with the system size, see Fig.Wc). However, for the non- 
local part Ao, shown in Fig.Wa), one sees clear scaling 
to zero which depends exponentially on the system size, 
supporting the division into local and non-local operators 
which we have implemented. 



To compare the size of the fluctuations with the magni- 
tude of the operator we define (O) E = Y^' n Onn- We can 
define the same in the rotated basis: 



Ao = £|0„ 



(Onn) r 



(24) 



with the interval for the sum defined as for Eq. (21 1. 
Strictly speaking (O nn ) mc is no longer the microcanon- 
ical ensemble average as n no longer labels the eigenen- 
ergies. Nonetheless one can define an analogue and we 
retain the same notation for ease of presentation. 



Energy distributions and coarse graining 

In order to plot the continuum energy distributions a 
coarse graining is necessary. The density of states is first 
made continuous by approximating 

v ( £ ) = ^2 S ( £ - £ «) ~ £%w( £ - e n ) , (25) 

n n 

with an envelope function: 
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FIG. 5: (Color online) Coarse grained density of states, i/(e), 
for the Hamiltonian H(l, 0.2) with N = 16. The coarse grain- 
ing width is W = 10(5, where 5 is the mean level spacing. 
Shown is the result after coarse graining (red circles), and the 
result after an additional running average (blue line). 
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FIG. 6: (Color online) Longer-range correlations for the inte- 
grable quench. Shown is the scaling of the relative deviation 
between ensemble averages A re i for \i — j\ — 1, 2, 3, 4, 5 (from 
bottom to top). 



Xw(e) 



-e 2 /(2W 2 ) 



(26) 



In the paper we have used W = 10(5 for N — 16, where 
<5 is the mean level spacing with an additional running 
average. The results of these procedures for the density 
of states are shown in Fig. [5] The same procedure is per- 
formed for the canonical ensemble. As a check that this 
is working correctly one must compare operator averages 
found with these coarse grained distributions and with 
the exact ones. Note that whilst a coarse graining over 
a wider energy range (e.g. W = 50(5) will give the same 
result for the density of states as in Fig. [5] it does not 
give accurate results for the canonical ensemble. For the 
microcanonical ensemble one simply broadens the delta- 
function around the initial energy E = (^o\H\^o), 

r mic (e) w 0(e - E + W/2) — 6(e — E — W/2) , (27) 

where, again, 9(e) is the Heaviside function. 

In principle one could also attempt this procedure on 
the diagonal ensemble. However for the system sizes we 
are able to consider we find that it is not possible to 
smoothen the diagonal ensemble and, at the same time, 
retain accurate averages for physical quantities. 



The integrable case 

Non-equilibrium dynamics in integrable systems and 
the question of the appropriate statistical ensemble to 
describe the long-time limit have been intensely stud- 
ied in recent years. [HJ HOI |42 44 In our paper we only 
briefly touched upon this issue by explaining how the 
splitting into a local and a non-local part can be gener- 
alized to the integrable case. While for non-interacting 



systems the additional local conservation laws simply be- 
come the occupation numbers of the diagonal modes they 
are quite complicated in the interacting case [28 and a 
detailed study is beyond the scope of this letter. Here 
we simply want to look for indications of integrability 
when taking only the Hamiltonian itself into the local 
part, as in Eq. (9) of the paper, thus ignoring all other 
locally conserved quantities. In Fig. [6] we present the 
same data as in Fig. 2 of the paper but for the integrable 
quench with |*o(5, 0)) and #(1/2, 0). The scaling behav- 
ior is now quite different. A re i for the correlation function 
(SiSi+2), in particular, shows an upturn for the largest 
system sizes indicating that the canonical ensemble does 
not describe the system at long times. To understand 
the scaling of the different correlation functions in more 
detail, an analysis of the overlap of each operator O with 
the local conserved quantities would be required. 

Finally, we present in Fig. [7] the analogue of Fig. 3 
of the paper for the integrable case. One of the obvi- 
ous differences is that the fluctuations in O nn for the 
case O = SiSi+i are substantially larger than in the 
non-integrable case, see Fig. (7jbl). In O nn , shown in 
Fig. |7|b2) and (c2), the largest fluctuations are of simi- 
lar magnitude as in the non-integrable case shown in the 
paper, however, substantial fluctuations persist to much 
higher energies. This could possibly be related to the 
different energy level distributions. While level repul- 
sion leads to a Wigner-Dyson distribution for a generic 
model, each state in an integrable model is uniquely char- 
acterized by the quantum numbers of the local conserved 
quantities so that states can cross. This leads to a Poisso- 
nian distribution and might be responsible for the differ- 
ent structure of fluctuations in Fig.|7];b2) and (c2). The 
fact that we are missing local conservation laws when 
using the canonical ensemble for an integrable system 
would, of course, be immediately obvious if we choose 
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FIG. 7: (Color online) Integrable quench for N = 16. Shown 
are: Shown are: (a) the initial, microcanonical and canonical 
energy distribution functions, and results for O — SiSi+i in 
(b) and O = SiSi+4 in (c). (bl) and (cl) show O nn while 
(b2) and (c2) show 6 



one of the additional local conserved operators as our 
observable whose expectation value would be time inde- 
pendent and thus characterized by the initial microstate. 
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